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13. ABSTRACT ^g pg j- f of 3 larger research project in musical structure, a program has 
been written which "reads" scores encoded in an input language isomorphic to music no- 
tation. The program is believed to be the first of its kind. From a small number of 
parsing rules the program derives complex configurations, each of which is associ.ated 
W'ith a set of reference points in a numerical representation oi a t ime- cont inuum. The 
logical structure of the program is such that all and only the defined classes of events 
are represented in the output. 

15ecauae the basis of the program is syntactic (in the sense that parsing operations 
arc performed on formal structures in the input string), many extensions and refinements 
can be made without excessive difficulty. The program can be applied to any music which 
t:an be represented in the input language. At present, however, it constitutes the first 
stage in the development of a set of analytic tools for the study of so-called atonal 
music . 

Ttie program and the approach to automatic data- st rue turing may be of interest to 
linguists and scholars in other fields concerned with basic studies of complex 
structures produced by human beings. 



Automatic da ta- s t ruct ur ing 

Con-putors 

M.'i ch ine- a id ed cc gn i t i on 



Multiple-access computers 
On-line computers 
Real-time computers 



Syntax- based analysis 
Time-sharing 
Time-shared computers 



DD 



1473 (M.I.T.) 



lyXCLASSIFTED 
Security Classificatit 



MAC-TR-39 



SYNTAX- BASED ANALYTIC READING OF MUSICAL SCORES 



by 



Allen Forte 



Project MAC 
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 



ABSTRACT 

As part of a larger research project in musical structure, a program has 
been written which "reads" scores encoded in an input language isomorphic to 
music notation. The program is believed to be the first of its kind. 

From a small number of parsing rules the program derives complex con- 
figurations, each of which is associated with a set of reference points in a 
numerical representation of a time -continuum. The logical structure of the 
program is such that all and only the defined classes of events are represented 
in the output . 

Because the basis of the program is syntactic (in the sense that parsing 
operations are performed on formal structures in the input string), many ex- 
tensions and refinements can be made without excessive difficulty. 

The program can be applied to any music which can be represented in 
the input language. At present, however, it constitutes the first stage in the 
development of a set of analjdic tools for the study of so-called atonal music, 
the revolutionary and little understood music which has exerted a decisive in- 
fluence upon contemporary practice of the art. 

The program and the approach to automatic data -structuring may be of 
interest to linguists and scholars in other fields concerned with basic studies 
of complex structures produced by human beings. 
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SECTION I 

SCOPE OF THE RESEARCH PROJECT 

During the years 1908-1922 musical composition underwent a remarkable 
and revolutionary transformation. This radical change was due almost entirely 
to the innovations of Arnold Schoenberg and his students, Anton Webern and 
Alban Berg. Among the major compositions of this time were Schoenberg's 
Pierrot Lunaire (1912), Webern 's Six Orchestral Pieces (1910) and Berg's 
Wozzeck (1919). The term "atonal, " commonly used to describe this music, 
is construed as "not exhibiting any of the structural characteristics of traditional, 
tonal music." However inadequate the term may be, it does indicate the fact 
that the structural determinants of this complex music are not well understood. 
No adequate structural description has yet been put forth which can be subjected 
to acceptable logical or pragmatic tests. The score-reading system described 
in this paper is part of a larger research project concerned with the development 
of such a general structural description. 

Certain basic decisions which led to the development of the score-reader 
are intimately connected with the goal of the larger project outlined above. The 
choice of atonal music as the object of research brings with it the need for an 
extensive revision of outlook. For example, conventional descriptive language 
("melody, " "harmony, " "counterpoint") is not very useful, and may even be a 
significant hindrance to new formulations, since it is oriented toward older 
music. When, in addition, problems of structural analysis are stated in terms 
accessible to computer programming, many conventions are set aside and many 
familiar concepts arc rejected after serious scrutiny. These considerations and 
others, discussed below, led ultimately to the design of the basic tool which is 
described in this paper. 



SECTION II 
APPROACH TO A SYNTAX-BASED METHOD 

2.1 THE SCORE AS A SYSTEM OF GRAPHIC SIGNS 

One can obtain information about a particular composition in a number of 
ways: for example, by reading statements made by the composer which purport 
to describe his composing techniques or his aesthetic views, or by asking a 
trained listener to supply a description of some kind. Such information is al- 
ways incomplete , however, and does not provide an adequate base for an analytic 
system. The musical score, on the other hand, constitutes a complete system 
of graphic signs and, properly represented for computer input, may be analyzed 
as a logical image of the unfolding musical events which make up the composi- 
tion. This was the point of departure for the score -reading program: the 
analysis of formal characteristics of sequences of graphic signs in the music 
input language discussed below. 

2.2 ADVANTAGES OF AUTOMATIC DATA-STRUCTURING 

When the trained analyst examines a musical score he associates certain 
signs to form units and makes a series of basic decisions about the temporal 
spans of such units and their internal structuring. In a metaphorical sense he 
places a template over the score to frame patterns and show how they are inter- 
woven to form local contexts. 

Although the decisions which the analyst makes may rest upon years of 
practical experience, they are often unsystematic and subject to many influences 
which are not easily identified. Nevertheless, it is apparent that at least part 
of the time he makes decisions according to rules of some kind. This suggests 
the possibility that rules could be stated. Such rules, interpreted as algorithms 
and stated in a programming language, would yield a complete and precise 
structuring of the data presented by the score. A tool of this kind should produce 
results at least as good as those produced by the human analyst. It should not 
give provisional readings that require further hand^editing or interpretation of 
some kind. Indeed, it would be appropriate to aim for a reader which would 



4 SECTION II 

produce results superior to those obtained by the human analyst. The use of the 
expression "human analyst" in this context is not intended to imply that the pro- 
grammed score reading process is based upon methods commonly used by 
human beings. The logical base of the program and its empirical relevance to 
a fundamental musical parameter, time, will become evident as we proceed. 

2.3 THE MUSIC INPUT LANGUAGE 

Before processing, scores are encoded by hand in a language designed by 
Stefan Bauer- Me ngelberg [l] in connection with a computer -implemented project 
to print scores automatically. This input language has unique attributes: 

(1) It is isomorphic to standard music notation. 

(2) It is highly mnemonic, hence easy to learn. 

(3) The responsibilities of the encoder are minimal. The encoding 
rules are unambiguous and do not require the encoder to make 
arbitrary decisions. 

(4) It is economical. The amount of code for a complete represen- 
tation is remarkaLly small. (See Figure 1.) 

The language uses a currently non-standard character set of 64 graphics [2], 
but a set of equivalent codes is available when the expanded set cannot be ob- 
tained . Figure 1 shows a page of a modem score and the corresponding code 
in Bauer-Mengelberg's language, prepared for input to the score-reading 

program. 

2.4 THE PROGRAMMING LANGUAGE 

The high level programming language SNOBOL3, designed by Farber, 
Griswold, and Polonsky [3], is used for the program. This language handles 
free strings with facility, and its generality renders it most appropriate for 
the kind of experimentation which necessarily accompanies a research project 
of this kind. The score reading program itself consists of a number of SNOBOL- 
encoded functions. A small amount of basic information is stored in lists, but 
the power of the program resides in the functions. These cope with' any eventu- 
ality in the input string, including encoding errors. 

2.5 OUTPUT FORMS 

The current forms are provisional. An appropriate syntactic structure 
for output will be developed as the reading process is tested and refined. In all 
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Figure 1, Four Pieces for Violin and Piano, Op. 7/l- Anton Webern 
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likelihood the output forms will ultimately employ a subset of the input language, 
since this offers the additional advantage of automatic printing of the results of 
an analysis . 

In the early phases of processing, the forms of the output (intermediate) 
strings are constrained by the syntax of the input language because of the rela- 
tion between input and output strings . The nature of the relation will be made 
clear in the next section, which begins with a summary of terms, conventions, 
symbols, and abbreviations used to describe the parsing process. 
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DESCRIPTION OF TIIE PARSING SYSTEM 



3.1 DEFINITIONS 

The following definitions are adopted for use in describing the parsing 
system. 

3.1.1 Terms 

Character 

Any of the standard characters in six -bit mode. (The extra character ":" 
is used in the intermediate strings.) 

Space -Code 

A sequence of characters in the input language which contributes to the 
specification of pitch. 

Accidental -Code 

A sequence of characters in the input language which, like a space -code, 
contributes to the specification of pitch. 

Pitch -Code 

A space -code followed by an accidental -code. 

Element 

A sequence of characters in the input language representing a distinct 
symbol in music notation. For example, "." represents the dot of music 
notation . 

String 

A sequence of characters which has markers to delimit beginning and end. 

Data String 

The original data before processing: a string in the input language. 

Input String 

A string of characters which is scanned to form a new string. The data 
string is the first input string. 

Intermediate String 

A string, constructed by the program from an input string, which repre- 
sents an intermediate stage in a parsing operation. 

Output String 

A processed string. 
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Position-Value 

An integer in an intermediate or output string which specifies the temporal 
location of an element or sequence of elements. 

Segment 

A distinct unit within an output string, delimited by "(" on the left and ")" 
on the right . 

Block 

A distinct unit within a segment, delimited in the output string by *'P" on 
the left and a single blank on the right. The corresponding unit in the 
input language is delimited by blanks on left and right. 

Parsing 

The extraction, from an input string, of strings of segments representing 
classes of event -configurations in the score. 

Terminal String 

The final output form for a class of segments. 

3.1.2 Symbols 

In the definitions which follow, "A" and "B" name arbitrary intermediate 
strings, and the defined infixes denote relations between, or operations upon, 
pairs of such strings. 

A— *-B A is replaced by B. 

AB Concatenation 

AflB The segment set shared by output strings A and B. 

A^B A has the same unordered contents as B. 

ACB A is a substring (proper or improper) of B. 

ADB A is a superstring (proper or improper) of B. 

3.1.3 Abbreviations 

Special Classes of Strings 

SG Segment 

PSGI The string of primary segments for input. 

PSGT The terminal string of primary segments. 

SSGI The input string of secondary segments. 

SSGT The terminal string of secondary segments. 

RSSGT A replica of SSGT . 

RPSGT A replica of PSGT. 
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Programmed Operations on Strings 

BCP Block-concatenation parsing operation. 

SCP Segment-concatenation parsing operation. 

Grammatical Categories 

These are used for input, intermediate, or terminal strings. The ab- 
breviations which follow will be used for describing the form of strings accord- 
ing to conventions given below: 

AC Accidental -Code 

TC Time-Code 

SC Space -Code 

PC Pitch-class , represented by an integer in the set { , 1 , 2 , ... 

11 } . The expression "pitch-class" or the abbreviation PC 
(after Babbitt [4]) is usually used in the sense "pitch-class 
representative". The meaning is clear from the context. 

PV Position-value 

RC Rest-code, indicating a silence. 

Izi A (literal) blank. 

3.1.4 Conventions 



Names of strings, segments, blocks, functions, function arguments, and 
indices are in upper case alphabetic characters. Usage conforms to the rules 
of the programming language: 

(1) Indices are separated from generic names by "." 

(2) Literals are enclosed within single quotation marks (' '). 

(3) Function arguments are enclosed within parentheses, and multiple 
arguments are separated by commas. 

Examples: 

A String name A. 

A.I String name with (variable) index I. 

A . '1 ' String name A with index 1 . 

B(X,Y) Function name B with arguments X and Y. 

Variables used as indices, such as I and J, do not imply anything about 
sequence, but merely indicate distinctness. 
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The Grammatical Categories listed previously are used in this paper to 
specify the form of a string as follows: If X and Y are grammatical categories , 
then the expression 

XY 
designates the result of writing, in the input language, a representative of the 
grammatical category X followed immediately by a representative of Y, with no 
intervening space . Occasionally literals are used where the string form described 
contains invariant items; for example, the expression 

PV '$' 
displays the form of a string consisting of some PV followed immediately by the 
literal dollar sign. 

3.2 PRELIMINARY OPERATIONS 

For the remainder of the paper a single composition will be used to illus- 
trate the work done by the program. This is shown in Figure 2 , together with 
the corresponding data string encoded in the input language . Two basic operations 
are performed on the data string before the main parsing begins. 

(1) Since the encoding is basically a one-pass procedure, the encoder pro- 
gressing from left to right across the page for each instrumental part, the char- 
acters which belong to a particular instrument are distributed throughout the data 
string. The program therefore begins by scanning the data string to assemble 
the complete code-sequence for each instrument. It must deal with such situa- 
tions as the following: In an orchestral score of 200 measures the bass clarinet 
enters on the last beat of bar 75, plays a staccato thirty-second note, and is not 
heard from again during the remainder of the piece. In this and all such cases 
where the part is not present either at the beginning or end of the score, the pro- 
gram finds the complete instrumental part. 

(2) The second operation establishes the basis for the derivation of complex 
event -configurations by the application of the parsing rules. This consists of the 
assignment of a position-value PV to each code which implies a temporal position. 
Two passes through the input string are required. On the first pass information 
relevant to the duration-value of time-codes is accumulated, hi particular, a 
normative or list value is assumed for each time-code. This value is adjusted 
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if a o-roupctte occurs in the same l)ar. For example, if the meter signature is 
2/4, an eighth note triplet is a groupctte and indicates three eighth notes in the 
(normative) time of two. For each bar a numJxn- is computed which represents 
the product of all groupette divisors. The number is one if there are no group- 
ettes in any part in a iiarticular bar. On the second pass a duration-value is 
assigned to each time-code by consulting the list of normative values and the 
numl^er which represents the product of the groupette divisors. The duration- 
value DUHVT for groupette members is them computed by the SNOBOL statement 

DURVL- (VL * GP2) - (Z/GP1) 

where YL is the list value of the time-code, GP2 is the normative division of 
that value, Z is the groupette product, and GPl is the groupctte divisor. For 
all other time -codes in a bar which contains a groupette or groupettes, 

DURVL= VL * Z 

For bars in which groupettes do not occur, the list value is the duration-value 
assigned. 

Once the position-values have been assigned — and this is straightforward 
after the duration values have been computed — position-value becomes the main 
parameter in the parsing since it indicates temporal position. The string can 
then be viewed as a succession of blocks within segments, each block containing 
the total of codes aKSociated with the point in time represented by the position 
value . hi this way it is possible to refer to any moment in a composition and to 
relate any event to any other event with respect to a time -continuum . In this 
regard as well as many others, the analytic technique differs fundamentally from 
those employed by researchers in other fields where the string is the basic data 
structure. The structural linguistsr, for example, arc not concerned with time- 
placement in this extended sense, v^hereas for music research the temporal 
dimension is always prime [r>]. 

The association of each event and each event-configuration with a set of 
position-values has the additional advantage that order characteristics are pre- 
served. Thus, complexities of rhythmic interaction can be deduced from the set 
of numbers attached to particular configurations. For example, collation functions 
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can be written to search for categories which might tae of special interest, such 
as all segments which have the same first position value. 

3.3 PARSING RULES 

3.3.1 General Comments 

For the description of the parsings, information falls conveniently into these 
categories: 

(1) Names of the strings being parsed. 

(2) An informal description of the parsing technique and a characterization 
of the event-configurations which it extracts from the input string. 

(3) A symbolic expression representing the form of the segment in the 
output string. 

(4) A formal statement of the parsing rules. 

(5) A description of the deletion conditions for segments in the output 
string. 

(6) niustrations of program-generated output, music notation, and scanning 
strategy. 

3 .3.2 Primary Segments 

Underlying any segmentation or parsing process are certain assumptions 
regarding the discreteness of events. What conditions determine a unit? Which 
classes of signs serve as disconnectives? How can one speak with precision of 
components of larger event-configurations? Questions such as these are in the 
foreground of the description of parsing rules, and an effort is made to state the 
answers clearly . 

For th(> first parsing of the input string, each substring representing a 
complete instrumental part is scanned to extract all primarv^ segments. A pri- 
mary segment is a continuous unit delimited by the graphic sign denoting silence: 
a rest in musical notation, RC in the input language. It begins with a PC which 
has been preceded by a rest (or by an understood [logical] rest if the pitch is the 
first in a part), contains no internal rests, and ends with a rest. Thus, it is set 
off from other substrings and is regarded as a disjunct sequence in a one- 
dimensional environment. The scan for primary segments in a instrumental part 
terminates when the input language code "//" (double bar) is encountered. 
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In the injml Language, a primary segment has the form 

RC "i^ SC AC TC SGb' KC 16 (^) 

With rasped to sequence in the string th<^ block KC AC TC always follows the RC . 
Since ther-e may be more than one RC in a substring of the input string without an 
InLervening SC AC TC-i"or example 

RC RC RC SC AC TC SC RC RC RC 

-the first RC in the form given above is always the rightmost with respect to 
AC, while the second RC shown in the form is always the leftmost with respect to 
SG. 

After parsing, the form of the segment (now in an intermediate string) is 
'*' PC ^KA' PV '$' ... 'KZ' PV '$' (2) 

where 'KA' and 'KZ' designate attack and release, respectively, and refer to the 
position -value which follows. In the case of 'KA' the position -value always refers 
to the first (attack) PC . In the case of 'KZ' the position-value refers either to 
the terminal delimiter (some RC) or, if the next code is '//', to some code which is 
an understood (logical) RC. See the viola part of Figure 2 for the latter. 

The form given in (2) above is the input form for a primary segment. The 
terminal form is: 

.^, ip! tyT fil p(-^ '*' PC , . . '*' PC ')' (3) 

The transformation from input form to terminal form involves two kinds of 
change: deletions, and deletions with replacements. Characters which are de- 
leted (without replacement) are 'KA', 'KZ', 1^, PV (when preceded by 'KZ'), 
groupette definers, and meter signatures. Each pitch-code is replaced by an 
integer representing a pitch-class PC . The appropriate mapping is executed by 
a function PCMAP, which utilizes modulo 7 arithmetic. A number of contextual 
details require attention here. First, if the instrument is a transposing instru- 
ment, the appropriate modulo 12 arithmetic must be performed. Second, a scan 
is made for space-codes which have implicit accidentals, under two familiar con- 
ventions of notation, and those accidentals are inserted. Third, the current clef 
sign must be made available for each pitch-class mapping. Figure 3 shows the 



DESCRIPTION OF THE PARSING SYSTEM 



15 





(P0*1 Plfe*-? )(P0*0 MP0*4 )(P48*? P64*5 P80»3 ) {Pf,'**'* P9b*U )(P6'f*ll P96 
♦11 }(P10^*5 P12P*^ )(P1?0*6*10 P128*6*10 )(Pl20*ll P!28»ll )(P144»0 ){P 
160*8 P192*8 P?6A»10 )(P160»7 )(P168*6 Pl76»6 )(P192*9 MP224*7 P240*7 ) 
(P27f#fl ](p-:i'[p*/, )(P77f»^ )(p^s?»5 ItP-^St*? Pi08*l 1(P4^2*0 P480*0 )(P46 
4»n Pif80»ll P488*ll )(P4eO*6 MP512»8 )(P5l2*9 )(P544*7 )(P548*8 )(P568 
•^ )(P568*11 )(P56a«l P576*2 »(P568*4 I 



Fig'ure 3. The Terminal String of Primary Segments (PSGT) 
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conii)lcte siring PSGT, together with the music notation where primary segments 
(31 in all) are indicated by parentheses. 

3 .3.3 Second ary Se'gments 

The string oi' primary segments is made up of substrings, one for each in- 
strumental part. Every (unordered) pair of these substrings is scanned for sec- 
ondary segments. This is the first and most important step in the progression 
from the one -dimensional primary segments to a representation of the analyzed 
two-dimensional score . At this stage we are concerned with the ways in which 
events represented by pairs of primary segments interact in local contexts. In- 
deed, one of the important consequences of the parsing for secondary segments 
is to show how the primary segments are affected by motion in other parts of the 
context. The following combinatory situation provides the basis for a formal 
description of the parsing. 

Given three temporal relations: less than (1:)cfore), equal to (coinciding), 
and gr eater than (after), together with the position-values of the attack (A) and 
release (R) for each of two primary segments SO. I and SO. J (where SO. I C 
PSGT. I and SG.J C PSGT. J), there are (3 + 4) - 2 ways in which the two seg- 
ments interact, excluding the non -contiguous cases. Figure 4 shows the 11 pos- 
sible interactions reduced to fi classes by virtue of the symmetry of the defined 
relations. Each class is assigned a mnemonic name in quasi -Polish notation; 
for example, EAR means that the relation equal to obtains between both A's and 
both R's, while LAER means that the first A is less than (before) the second, 
while the two R's coincide. The same name is applied to the second (symmetrical) 
relation in each class and a subscript is used to distinguish the two. 

As indicated above, a given pair of primary segments is scanned (left to 
right) for Al , Rl , A2 , and R2 . Figure 5 is a graph of the decision structure 
which is traversed to determine the particular relation between the pair. 

When the relation has been determined, one of four functions is called and 
secondary segments are formed as shown by the arrows in Figure 4. Observe 
that the basis of segmentation is attack and release operating reciprocally on the 
constituent primary segments. Since the disconnectives (attack and release 



DESCRIPTION OF THE PARSING SYSTEM 17 



EAR 



EAGR 



LAEP 



GAR 



LAGR 



SG. I 
SG.J 



SG.J 



51 



Al 



A2 



52 



SI 

SG.I Al ^Pl 



A2 R2 

52 



-4- 

A2 R2 

S3 54 



PI 



R2 



A2 R2 



51 S3 

SG.r Al PI 



SG.J 



SI 52 

SG.I Al Rl 



SG.J 



SG. 1=5152 



SG.I = 51 52 



5G.1 = 51 
SG.2 = S3 
SG.5 ^ SI S2 



SG.I = SI 
SG.2 = 52 
SG.5 = 52 S3 



A2 R2 

53 
51 S2 

SG.I Al PI SG.I r 51 

^^ ? * SG.2 - S2 

' ' SG.3 = S3 

SG.J ^ j- ^ SG.4 = S4 



SG.5 = SI 54 



SI 52 53 Rl 

SG.I tl ^ , , SG.I = SI 

^ ^ 5G.2 = 52 

i 5G.3 = S3 

SG.J \ f SG.4 = 5 2 54 

A 2 P2 

S4 



Figure 4. PoKKible Interactions for Two Segments 
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markers) of the original primary segments now themselves point to other dis- 
connective s, it is evident that the parsing is recursive in the limited respect. 

The scanning strategy is straightforward: If k is the number of substrings 
in the string of primary segments -each substring representing an instrumental 
part-then the number of scans required to find all secondary segments is the 
same as the number of distinct pairs in the unordered cross-product of a set of 
k elements 

(k(k+l)/2)-k=k(k-l)/2 

Each scan of a pair of strings of primary segments yields a string of secondary 
segments (possibly null). The name of this string is SSGI.JK, where the vari- 
able index J is the index of one member of the pair of primary segments, and K 
is the index of the other. Figure 6 shows the complete output string SSGT . 

The scanning strategy for the deletion of duplicate secondary segments 
utilizes the fact that for any pair of strings of secondary segments SSGI. J and 
SSGI.K, where J and K represent concatenated indices of primary segments as 
described above, duplicate segments will occur only if one element of the com- 
pound index J is the same as an element of the compound index K. This strategy 
greatly reduces the number of scans required and permits the operation to be 
controlled by a non-arithmetic program segment in a fashion idiomatic to 
SNOBOL. The program segment which performs this operation is shown in 
Figure 7 . 

3.3.4 BCP (SSGT) 

A secondary segment results from the interaction in time of the attacks 
and releases of two primary segments. Possibly a particular secondary segment 
is a constituent of a still larger event -configuration; that is, it may be associ- 
ated with other events in the local context. The parsing function BCP (l^lock 
concatenation parsing) finds such situations and derives a new class of segments, 

BCP operates on SSGT and its replica RSSGT as follows. Each block in 
each segment of SSGT and RSSGT has the form 

tpi pv ^*' PC. '1' '*' PC. '2' '*' . . . PC.nW 
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Figure 6. The Terminal String of Secondary Segments (SSGT) 
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Figure 7. Program Segment for Duplicates in SSGT 
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For every block B.I in a segment SG.l (SG.I C SSGT), with PV.I, and every 
block B.J in SO. J (SG.JC RSSGT), with PV.J, 

if PV.I =^ PV.J 
then 15.1-^ H.I B.J 

In this way, a secondary segment SG.I is expamied by infixing all blocks in all 
other segments which intersect in time with some block in SG.I. Two ;idditionaI 
aspects should be noted. First, concatenation is followed by the deletion of du- 
plicate IKVs. Second, the process is not recursive, in the sense that only the 
original blocks in a segment are of^e rated on; concatenation does not extend to 
blocks derived from the parsing. 

Possibly some secondary segment does not contain any PV block which 
intersects with the PV of some other block in the way defined. In this case the 
segment is complete in itself and docs not belong to a larger context. Such resi- 
dual segments are deleted when the execution of the function has been completed. 
It is also possible that a segment in the parsed string will have one or more du- 
plicates. These internal duplicates are deleted after RCP has been executed. 

Because the concatenation does not take into consideration the order of 
Iiosition-valucs or the order of jjitch -class representatives, it is necessar>' to 
call collation functions within RCP. I^tch-classes are placed in ascending 
order at present. However, the set of indices representing original ordering 
could be retained, wherever that ordering is significant. Figure 8 shows the 
output string R(' P(SSC/[') and the corresponding notation, with segments tlelimited 
by frames . 

3.3.5 HCP(PSG T) 

Wlien RCP operates on PSGT the result is a string each component of which 
is anchored to a position-value which occurs in two or more instmmcnts. It 
should be remarked here that the "instrumental part" is merely a device. Once 
the position-values have been assigned, the location of an element within a parti- 
cular part is of no fundamental concern. The reading thus is independent of the 
notions of "part," "voice," "voice-leading" and other interpretive formulations 
which belong to a higher anal}4ic level. 
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Figure H. Block-Concatenation ParKinj]^ of SSGT 
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Deletion conditions for BCP(PSGT) are as follows: residual primary seg- 
ments are deleted, and internal duplicates are deleted. Since the rules for BCP 
when applied to PSGT may form duplicates of secondary segments, a scan must 
be made to delete such duplicates. Figure 9 shows the complete output string 
BCP(PSGT) and corresponding notation. 

3.3.6 SCP(SSGT) 

The content of segments derived by BCP is limited with respect to span 
since the concatenation is restricted to units of block to event -configurations of 
greater span since it defines the segment as the concatenation unit. 

For every PV.Iin SG.I (SG.ICSSGT) and every PV.J in SG.J (SG.JC 
RSSGT), 

if PV.I=PV.J 
then SG.I-^SG.I SG.J 

As remarked above, SCP finds event -configurations which are connected by {at 
least) one cbmmon PV. The SCP-derived segments thus represent continuity 
over longer spans, even though disjunctions may occur in some subparts of the 
musical contexts As a consequence SCP yields very intricate and subtle readings. 

The scanning procedure for SCP differs significantly from the scan for 
BCP, In the case of BCP the scan for a new segment always begins at the head of 
the string. This is not true for SCP. Here the first PV in each segment is a 
potential prime generator of a segment in SCP. If a particular first PV has not 
previously been a prime generator it is qualified to become one. Let us assume 
that some first PV, PV. '1' is a prime generator. The scan to find segments for 
concatenation begins with the first segment to the right of SG. '1', SG. '2' in 
RSSGT. If the scan finds a match for PV. '1' in SG. '2' the entire segment is con- 
catenated with SG.'l' after duplicate blocks have been removed. The PV. '2' in 
SG. '1 ' then becomes a secondary generator; the scan begins again with the first 
segment to the right of SG. '2' in RSSGT, and so on. Possibly the segment which 
is concatenated contains position-values which are not in the original SG. '1'. 
Such position- values become secondary generators. 
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Figure 9. Block-Concatenation Parsing of PSGT 
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In this way, links arc formed to other segments and the new segment in 
SCP is expanded until there are no position -values for which a complete scan has 
not been made. Thus, unlike BCP, the scan for SCP is ordered: there is no re- 
tracing and a prime generator heads only one segment. The characteristic nest- 
ing of segments generated by SCP is evident in Figure 10, which gives the com- 
plete output string and a partial representation of the corresponding notation for 
SCP{SSGT). This output string contains eleven segments in all. Because of 
density, however, it is not feasible to give all output information in notation. 
Figure 11 gives SCP(PSGT) in its entirety. 

3.4 SUMMARY: STRUCTURE OF THE PARi^ING SYSTEM 

From the preceding descriptions of the parsings which make up the system 
it should be evident that the system is hierarchic, in the initial sense that pri- 
mary segments are formed before secondary segments. There is also a hierarchic 
relation between the function BCP and the function SCP. Specifically, if we regard 
the function calls as string names: (1) Every SG in BCP(PSGT) is a subsegment 
(possibly improper) of some segment in SCP{PSGT); (2) Every SG in BCP(SSGT) 
is a subsegment (possibly improper) of some SG in SCP(SSGT). Moreover, the 
nesting of BCP or SCP, or BCP and SCP does not produce additional distinct 
strings of segments. For example: 

SCP(PSGT) D SCP(BCP(PSGT)) 
SCP (SSGT ) D SCP (BCP(SSGT )) 
BCP(PSGT) D BCP(SCP(PSGT)) 
BCP (SSGT) 3 BCP(SCP(SSGT)) 



Also, 



and 



BCP(SSGT, PSGT) D BCP(PSGT) n BCP(SSGT) 



SCP(SSGT,PSGT) D BCP(SSGT) fl SCP (SSGT) 

Finally, every SG in BCP(SSGT, PSGT) is a segment in BCP(PSGT) or BCP(SSGT), 
and every SG in SCP (SSGT, PSGT) is a segment in BCP(SSGT) or SCP(SSGT). 
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Figure 10. Segment -Con catenation Parsing of SSGT 
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Figure 11. Segment -Concatenation Parsing of PSGT 
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Thus, the most efficient (in the sense of shortest scan) and complete set of 
distinct parsings possible within the system is represented by the following six 
names: 

PSGT 

SSGT 

BCP(PSGT) 

SCP(PSGT) 

BCP{SSGT) 

SCP(SSGT) 

As the output strings are generated in the above order, the schema of 
string relations for the deletion of all duplicate segments follows accordingly: 

IN DELETE DUPLICATE SEGMENTS OF 

PSGT PSGT 

SSGT SSGT PSGT 

BCP(PSGT) BCP(PSGT) SSGT PSGT 

SCP(PSGT) SCP(PSGT) BCP(PSGT) PSGT 

BCP(SSGT) BCP(SSGT) SSGT 

SCP(SSGT) SCP(SSGT) BCP(SSGT) SSGT 
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FROM SCORE READING TO HIGHER ANALYTIC LEVELS 

After a reading has been obtained, the question ariscH: How is the output 
to be interpreted at a higher level ? A eomprehensive answer to this question 
would exeeed the scope of the present report. Nevertheless, some indication of 
the nature of further analytic operations can be given. 

A sequence of linked programs in the MAD programming language has been 
written to process the output from the score reading program in order to yield 
increasingly higher representations of structural relations. If the collection of 
pitch-class representatives in each segment of the output strings is scanned to 
remove duplicates, the resulting collection maybe called a compositional se t. 
Then, from a listing of all the compositional sets, the analysis programs: (1) 
determine the class to which each set belongs; (2) list and count all occurrences 
of each set-class represented; (3) compute, for each pair of set-class represen- 
tatives, an index of order similarity; (4) determine the transposition-inversion 
relation for each pair of set -class representatives; (5) list, for each set-class 
represented, those classes which are in one of three defined similarity relations 
to it and which occur in the work being examined; (6) summarize in matrix format 
the set-complex structure of the totality of classes represented in the work; (7) 
accumulate and retrieve historical and other informal comments in natural 
language . 

These programs thus permit environments to be examined for similarities 
and differences and for distributional characteristics which might suggest gen- 
eralizations about the structure of the particular work or about the structural 
determinants of the entire class of compositions to which the work belongs. To 
illustrate this, let us consider certain aspects of the pitch -structure of two con- 
tiguous contexts in the Webern work parsed by the reading program. 

The first context, extending from PV = 160 to PV — 2 76, is represented in 
Figure 12. The position-values are arranged from left to right at the top of the 
illustration. Each row below then gives the pitch-class repf^sentatives for one 
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15 9 7 7 2-2 

16 8 10 2-2 
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20 7 7 8 2-1 
21 

BCPCPSGT) 
22 7,8 

23 
BCPCSSGT) 
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25 6 6 8,9 3-2 

26 8,9 7 7 3-1 

27 8,9 8 2-1 
SCP(SSGT) 

28 7,8 6 6 8,9 7 7 10 8 5-1 
29 
30 
31 7 7 10 8 3-2 
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Figure 12 . Pitch Structure from PV =160 to PV= 276 
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segment. The name of the string from which the segment derives is indicated 
by the headings at the left margin. The name of the set-class to which the com- 
positional set corresponding to the segment belongs is given in the rightmost 
column. The first integer gives the cardinality of the set, the second gives the 
ordinal position on a stored list. Criteria of class membership are based upon 
interval content and will not be explained here [6]. (Since a set containing one 
element does not form an interval, it is not distinctive with respect to interval 
content. ) 

The largest set in Figure 12 is the set numbered 28; class 5-1. It is evi- 
dent by inspection that all the other sets are subsets of this one and therefore 
all components of the context belong to a single set-complex. It should be said 
that the determination of set-complex structure is not as easy as this in other 
cases. In all, seven set-classes are represented: 5-1, 4-1, 3-1, 3-2, 2-1, 2-2, 
and 2-3. The theoretical set-complex about 5-1 contains 15 set-classes. 

The second context is displayed in Figure 13. There the largest set is 
numbered 26: set class 6-1. The number of set-classes represented is nine: 
6-1, 5-1, 3-1, 3-2, 3-5, 2-1, 2-2, 2-5, and 2-6. Ml but two of these belong 
to the set-complex about class 6-1. The two referred to are 3-5 and 2-6. Set 
2-6 is a subset of 3-5, which first occurs in the output string BCP(PSGT) and can 
be seen in the score as the simultaneity at bar 10. Thus, within the context there 
is a distinct division into two parts on the basis of set-complex structure, a divi- 
sion which is signalled by an entrance of pitch-name C at PV = 432. 

A comparison of the two environments shown in Figures 12 and 13 reveals 
that all the sets in the first are transformed subsets of set number 26; in the 
second, set class 6-1. This provides a basic and audible measure of change 
since the event -configuration which begins at PV —312 is a continuation of the 
intervallic complex of the preceding context. The next distinctive event, from 
the standpoint of pitch -relations, is the formation discussed above: the pitch- 
class collection 0, 6, 11, which is a member of class 3-5. 

Many detailed contextual associations are revealed by interpretation beyond 
the reading stage. In Figure 14, (A) shows a secondary segment distributed be- 
tween viola and cello. When this is compared with the primary segment at (B) the 
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two are seen to be related: they are representatives of the same set -class. The 
first (X) is a transposition of the second (Y) at the interval of eleven half-steps. 
The relation is nontrivial, for the two notes F and Eb that sound with the E in X 
are elements of Y. The association of the two contexts is further strengthened 
by the simultaneities V and W. Again, these are members of the same class: 
V is a transposition of W at the interval of seven half-steps. 

It is hoped that this necessarily incomplete treatment demonstrates the 
way in which a basis for a still higher level of analysis can be constructed in 
terms of a system of pitch and interval relations interpreted with the aid of 
theoretical concepts. 
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(B) Primary Segment 




(A) Secondary Segment 

Figure 14. The Association Between Two Contexts. 




pl^Af^V i.£i-/1£NT 
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Figure 15. Extending the Parsing System Detail 
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SECTION V 

EXTENSIONS AND IMPLICATIONS 

A simplified input strinji,' was used in order to facilitate development of the 
parsing program. This limitation in no way affects the generality of the system. 
Any syntactic class of interest to the analyst can be associated with a position- 
value , and an appropriate output format can be designed to accommodate as many 
properties of event -configurations as desired. 

Since the parsing rules are based on the occurrence of syntactic structures 
in the input language (which is capable of representing every syntactic class), it 
is possible to extend the system in many ways. It could be extended, for exam- 
ple, with respect to depth of detail, as explained below. 

In Figure 15, the primary segment, delimited by the bracket below the 
staff, contains two subsegments of interest: (A) and (B). In particular, (B) is 
delimited by the slur, with end points marked by "*". In the corresponding input 
language statement, given below the staff, these end points are represented by 
"LI". From the standpoint of programming, it would not be difficult to write a 
parsing function, modelled on the parsing for primary segments, to isolate all 
such subsegments marked by slurs within prinaary segments. Any syntactic 
class or combination of classes could be defined as delimiters in a similar way, 
thus creating new analytic strata. It might be pointed out in this connection that 
aspects of individual composition "style" could easily be investigated to any depth, 
provided, of course, that the researcher can specify the syntactic conditions with 
sufficient precision. 

The second possibility for extending the program is more sophisticated 
and correspondingly more difficult: that of providing an inductive capability such 
that scans would be modified under certain conditions to consider other possible 
parsings or to scan for certain structures conditionally. This heuristic facility 
would greatly increase the power of the program, and would make it possible to 
investigate context- sensitivity in greater depth. 



36 SECTION V 



In conclusion it should be noted that the score -reading program suggests 
that a feasible long-range project would be the development of a precise descrip- 
tive language for event -configurations. Such a language vi^ould be valuable for 
the study of musical statements in general and would be especially suitable for 
computer -implemented studies. Much work remains to be done. 
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